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ABSTRACT 

A  series  of  15  successive  5-day  mean  500mb  height  fields  in  grid- 
print  form  covering  the  quasi-hemispheric  area  between  14.8  -  70. 2N 
latitude  was  used  in  this  experiment.   The  time  period  extended  be- 
tween 15  May  1966  through  28  July  1966,   The  stepwise  regression  pro- 
cedure was  employed  to  derive  statistical  estimator  equations  for 
each  mean  map  field  Zc(I,J,t+l)  in  terms  of  the  preceding  field  data 
Zc(I,J,t)  as  the  primary  predictor,  (t+1  =2,  ...,  15).  As  addition- 
al predictors,  the  5-day  mean  composited  gridpoint  values  of  the 
NIMBUS  II  MRIR  equivalent  black  body  temperatures  in  the  water-vapor 
channel  (denoted  T^ )  and  the  window  channel  (denoted  T-)  were  intro- 
duced for  both  analyses  times  t  and  t+1.   The  five-predictor  regres- 
sion equations  thus  developed  for  Zc(I,J,t+l)  proved  to  have  sub- 
stantial statistical  significance,  both  from  the  standpoint  of  the 
specification  and  of  prediction  of  Zt.(I,J,t+2)  valid  five  days  later 
(t+2  =3,  ...,  15).   In  the  latter  connection,  the  prognosis  of 
Zc(t+2)  was  significantly  improved  when  all  five  predictors  were 
included,  as  compared  with  the  case  when  the  data  of  either  radio- 
metric channel  T-(I,J)  or  T-(I,J)  at  times  t+1,  t+2  was  deleted  from 
the  prediction  equations. 
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1,.  Introduction 

The  objective  of  this  study  was  to  determine  the  usefulness  of 
introducing  satellite  radiometric  data  in  developing  improved  regres- 
sion equations  for  the  prognosis  of  the  500mb  contour  height  field. 
The  measure  of  the  improvement  is  based  upon  a  comparison  with  the 
skill  indicated  by  use  of  a  simple  lag-correlation  analysis  between 
Zj.(t+l)and  Zc(t)  over  all  grid  points  of  the  prediction  area. 

A  series  of  15  successive  5-day  mean  500mb  Northern  Hemisphere 
analyses  were  provided  on  magnetic  tape  through  the  kind  auspices  of 
the  Extended  Forecast  Section  of  the  U.S.  Weather  Bureau.   These 
analyses  ranged  over  the  successive  non-overlapping  periods  15  -  19 
May  1966  (denoted  Z^(l)),  and  on  in  order  to  the  final  analysis  of 
24  -  28  July  1966  [denoted  Z  (t=15)].   This  set  of  15  analyses  pro- 
vided 15  fields  Zc(I,J,t),  (t=l,  ..,.,  15),  where  I, J  are  the  geo- 
graphic grid-point  identifiers. 

The  rationale  of  employing  5-day  mean  analyses  in  this  experi- 
ment was  based  upon  the  consideration  that  the  composited  5-day  mean 
radiometric  data  fields  of  NIMBUS  II  over  identical  time  periods 
would  be  obtained  with  as  closely  comparable  a  degree  of  feature- 
smoothing  as  is  involved  in  the  Extended  Forecast  Section's  analysis 
procedure  for  deriving  Z_(I,J,t). 

The  5-day  mean  radiometric  fields  employed  were  the  grid-point 
composited  values  of  equivalent  black  body  temperature  T   in  the 
medium  resolution  infrared  radiometer  (MRIR)  channels  1  and  2  of 
NIMBUS  II  (c.f.,  NIMBUS  II  User's  Guide,  1966).   For  this  satellite, 
these  two  channels  were  also  called  the  water -vapor  channel  (6.4  - 
6.9|Jm)  and  window-channel  (10-Il(jm),  respectively.   With  this  under- 
standing, these  fields  were  denoted  T. (I,J,t)  and  T  (I,J,t),  respec- 
tively.  The  full  lifetime  of  75  days  of  observational  data  of  the 
NIMBUS  II  MRIR  scanning  radiometer  was  available  for  this  experiment. 


thus"  accounting  for  the  duration  of  the  test  and  the  number  of  analyses 
involved  (t=l,  ....,  15).   The  composited  radi(»etric  data  were  provided 
in  hard- copy  form  through  the  generous  cooperation  of  the  National 
Space  Science  Data  Center. 

Other  radiometric  fields  from  NIMBUS  II  could  have  been  included 
in  the  regression  study.  However,  in  an  earlier  study  (Martin,  1969) 
showed  that  channel  4,  which  sensed  in  the  spectral  range  5-30|jm,  was 
virtually  redundant  with  channel  2  since  there  existed  a  spatial  corre- 
lation coefficient  rCTj,  T.)  »  .98.  Furthermore,  the  choice  of  the 
two  infrared  channels  used  here  seems  to  afford  the  best  possibility  of 
continuity  for  application  of  the  method  developed  to  future  generations 
of  polar  orbiting  satellites  planned  for  the  1970' s. 
2.  The  data. 
a.   The  time  series. 

Three  separate  time-series  of  analysed  5-day  mean  map  fields  were 
processed  into  similar  grid-map  arrays  for  each  of  the  identical  analy- 
sis times  t  (t=l,....,  15): 

t  =  1,  OOGMT,  20  May  1966;  t  =  2,  25  May  1966 

t  =  3,       30  May  1966;  t  =  4,  4  June  1966 


t  =  9,        29  June  1966 ;t  =10,  4  July  1966 

t  =11,         9  July  1966 ;t  =12,  14  July  1966 

t  =13,        19  July  1966 ;t  =14,  24  July  1966 

t  =15,  OOGMT,  29  July  1966. 

Each  analysis  time  specified  represents  the  averaging  period  over 
the  just-preceding  five  days  (e.g.  t=l  denotes  an  average  over  15-19 
May,  which  may  be  characterized  as  the  date-time  analysis  for  OOGMT, 
20  May  1966). 
b.   Geographic  extent  of  the  original  fields. 

(1)  The  5-day  mean  fields  of  500mb  geopotential  height  were  ini- 
tially in  the  usual  NMC  polar  stereographic  map-projection  coordinates 
(i,j). 


(2)  The  5-day  mean  fields   T.(I,J,t)  were  quasi-global  spanning 
the  region  70S-70N  in  Mercator  projection  coordinates  (I, J)  at  5 
longitude  intervals. 

(3)  The  field  T«(I,J,t)  covered  the  same  region  as  T, (I,J,t). 
The  Mercator  grid  points  (I, J)  were  converted  into  longitude  X  and 
latitude  cp  through  the  formulas 

X  =  30°+  (I  -  l.)5°         1  =  01, ,73  (1) 

InC^"^^^^  =  (21.  -  J)(5Tr/l80)       J  =  01, ,41  (2) 

Eq.  (1)  gives  longitude  X  in  degrees  east  of  Greenwich,  while 
(2)  gives  cp  (in  radians)  on  a  non-linear  scale  which  is  defined  in 
tenns  of  J  in  Table  1.   In  this  table,  equal  magnitudes  |j-21.  | 
correspond  to  equal  latitudes  cp  relative  to  the  equator.   This  is 
indicated  in  Table  1  by  coupling  the  sjmnnetric  values  J,  42-J  under 
the  column-head  "J-value,"  with  identical  cp-values  north  or  south  in 
the  adjacent  column. 

An  example  of  the  Mercator  mapping  of  T„(I,J,t=6)  is  shown  in 
Fig.  1.   In  this  figure,  a  rather  heavy  degree  of  smoothing  has  been 
applied  to  the  original  data  over  the  73  by  41  array  but  the  smooth- 
ing merely  serves  to  highlight  the  large-scale  global  map  features, 
and  is  otherwise  irrelevant  to  the  study  outlined  here. 

In  this  study,  each  radiometric  field  was  encoded  by  rows  in  the 
form  indicated  by 

T^    [I,J,t],  I  =  1,....,  72;  J  =  41, ....,01 

As  indicated  earlier,  no  spatial  smoothing  was  performed  on  any  radio- 
metric data  array,  since  the  value  recorded  at  any  grid  point  (I, J) 
was  already  composited  of  a  large  number  of  scan-spots  within  the  5- 
degree  longitude  "square"  surrounding  (I, J).  All  fields  T-(I,J,t), 
T«(I,J,t)  were  converted  to  IBM  card  format  in  the  manner  just  suggested, 
c.   Collection  of  data-arrays  in  compatible  Mercator  format. 

Since  Z£.(i,j,t)  is  considered  to  be  a  conventional  form  of  analy- 
sis, while  T  (I,J,t),  T2(I,J,t)  are  experimental  for  the  purposes  of 


TABLE  1.   Conversion  from  J-value  on  Mercator  grid-map  to 
corresponding  latitude,  (north  or  south)  using  Eq.(2). 


Lat.(deg.) 

Lat. (deg.) 

J-value 

by  Eq.2 

J-value 

by  Eq.2 

01,41 

70.2(N,S) 

12,30 

41.0    (N,S) 

02,40 

68.4 

13,29 

37.1 

03,39 

66.5 

14,28 

33.0 

04,38 

64.4 

15,27 

28.7 

05,37 

62.2 

16,26 

24.2 

06,36 

59.8 

17,25 

19.6 

07,35 

57.2 

18,24 

14.8 

08,34 

54.3 

19,23 

9.9 

09,33 

51.3 

20,22 

5.0 

10,32 

48.1 
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44.6 
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this  study,  the  former  has  been  converted  to  the  Mercator-mapping  des- 
cribed in  equations  (1)  and  (2),  The  geometric  relationships  involved 
In  the  transformation  from  polar  stereographic  to  Mercator  coordinates 
are 

tan  (X  -  10°)  =  |5||  (3) 

and 

sin  cp=  973.71  -  [(U32,)^+  (j-32.)^]/973.71  +  (i-32.)^+  (j-32.)^iy  (4) 
Equations  3  and  4  make  it  possible  to  compute  Z_ (I, J)  at  whole  multiples 
of  5  longitude  and  at  values  J  which  are  compatible  with  cp  of  Eq.  (2), 

Because  the  conventional  NMC  polar  stereographic  chart  has  an 
equatorward  boundary  which  averages  close  to  latitude  13N,  only  linear 
interpolation  of  Zt.(I,J)  along  J  =  18  (corresponding  to  14. 8N)  could 
be  performed.  However,  for  all  J^  17,  (i.e.  cp  >  19. 6N)  all  other 
points  (I, J)  were  sufficiently  well  removed  from  the  NMC  boundary  that 
Bessel's  central  difference  interpolation  scheme  [p.  252,  Haltiner 
(1971)]  could  be  applied  at  all  other  Mercator  points.   An  example  of 
the  NMC  map  field  at  t  =6,  after  conversion  to  the  Mercator  projection 
(1),(2),  is  shown  in  Fig.  2. 

It  should  be  noted  that  the  resulting  conversion  to  Mercator  co- 
ordinates leaves  us  with  contour-height  data  defined  only  on  rows 
J=18,  17,....,  01  and  for  the  72  columns  1=01,....,  72.  Thus,  for 
purposes  of  point-by-point  regression,  all  radiometric  data  points  in 
T.(I,J,t),  T(I,J,t)  with  J  >  18  were  stripped  from  their  card  decks. 
For  each  time  t,  the  Mercator-listed  fields 

T^(I,J,t),  T2(I,J,t),  Z^(I,J,t)   J  =  18, ,  01,  I  =  01, ,  72 

t  =  1,....,  15 
vere  arranged  in  sequential  order  with  respect  to  I  and  J,  taking  care 
not  to  break  the  sequence  within  any  field.   In  the  form  just  speci- 
fied, each  field  T- ,  T^,  Z^  consisted  of  1296  geographically  ordered 
data  elements  valid  at  time  t. 


A  Fortran  program  was  devised  to  sort  all  data  elements  at  (I, J) 
to  produce  a  new  punched  card  deck  having  the  rearranged  data  array  A 
on  each  card: 

Data  Array  A  at  (I. J)'. 

T^(I,J,t),  T^(I,J,t+l),  T2(I,J,t),  T2(I,J,t+l),  Z^(I,J,t),  Z^(I,J,t+l) 

I  =  1} • • • • y  72,  J  =  18,.«*.y  01. 

Each  data  array  of  the  form  A  now  applies  spatially  over  each  of  the 
18  X  72  =  1296  Mercator  gridded  points  for  the  successive  times 
(t,  t+1) .   This  array  by  paired  times  will  be  considered  valid  at 
time  t+1  in  the  statistical  regression  experiment  to  be  performed 
in  Section  3. 
3.   The  statistical  experiments  on  the  dependent-data  arrays. 

For  each  data  array  A  spanning  the  entire  1296  grid  point  map  field 
(I, J)  at  time  t+1,  three  separate  tests  were  performed,  as  described 
below.   In  Test  1,  a  one-predictor  specification  equation  was  generated 
in  the  form 

Z^(tn)   =  A^  +  A^Z^Ct)  (5) 

whereas  in  Tests  2  and  3,  respectively,  three-  and  five-predictor 
specification  equations  were  generated  from  the  same  grid-sample  data 
base.   Thus,  the  Test  2  regression  equation  was  of  form 

Z^Ct+l)  =  B^  +  B^T^Ct)  +  B^T^Ct+l)  +  B^Z^Ct)  (6) 

while  the  Test  3  equation  was  developed  in  the  form 

Z3(t+1)  =  C^  +  C^T^(t)  +  C^T^Ct+l)  +  C^T^Ct)  +  C^T^Ct+l) 

+  C5Z3(t)  (7) 

The  regression  equations  of  form  (5),  (6),  (7)  were  derived  simply 
by  using  the  delete-variable  option  of  the  BIMED02R  stepwise  regression 
program  [designed  after  Dixon  (1966)]  on  file  in  the  program  library 
of  the  W..  R.  Church  Computer  Center  of  the  Naval  Postgraduate  School, 
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In  the  progressive  application  of  the  program  to  develop  the  regres- 
sion equations  (5),  (6),  (7)  at  each  time  t+1,  the  significance  of  the 
kth  variable  selected  could  be  assessed  using  the  F, -statistic  upon  entry 
(an  output  of  the  program  at  each  step).   This  statistic  may  be  defined  as 

j^   /^        ,  -,\_['^   cum. expl.var. , step  k]-[^  cum«expl.var.  ,step  (k-1)]      .^^ 
k   *  [%   unexplained  variance  at  step  k] 

Here,  n  is  the  number  of  degrees  of  freedom,  taken  equal  to  the  sample 
size,  n  =  1296,  Miller's  criterion  (1962)  for  significance  of  the  regres- 
sion equation  at  or  above  the  95^  confidence  level  is  that  each  predictor 
selected  yield  an  F,  (l,n-k-l)  in  excess  of  a  critical  value  at  the  kth 
step  defined  in 

^*05/(P-k«)(l>''-'^-l> 
Such  critical  values  may  be  obtained  from  tables  of  F-values  for  a  95^ 

confidence  estimate  at  each  step.   The  values  for  Tests  1  and  2  are  bounded 

(c) 
by  those  obtained  in  the  five-predictor  case  (P=5),  for  which  case  F, 

was  found  to  have  the  following  critical  values  at  step  k,  (k=l,...,5): 


Step  k  =  1        2        3        4        5 

Fi^/l,,-,  =6.63      6.24      5.76      5.02      3.84 
Cl/P-k+1 

In  each  regression  equation  developed  over  the  entire  time  span 
whether  by  Test  1,  2  or  3,  all  variables  used  in  equations  5,  6,  7  were 
found  to  be  significant,  at  least  at  the  95^  confidence  level,   A  case- 
by-case  listing  of  the  coefficients  of  the  14  regression  equations  by 
each  test  is  given  in  Appendix  Table  5,   For  the  remainder  of  this 
section,  a  brief  summary  of  the  statistics  resulting  from  the  use  of 
these  equations  will  suffice.   These  summarized  statistics  are  displayed 
in  Table  2. 
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TABLE  2.   Summary  of  the  results  of  the  specification  Tests  1, 
2,  3.   The  time  mean  values  of  the  coefficients  of  the  variable 

Z^Ct),  T^Ct+l),  T^Ct),  T^(t+1),  T^(t)  _ 

are  shown  along  with  their  standard  deviations  (S.D.),  In  addition,  R 
the  mean  fractional  explained  variance,  averaged  over  each  time  series 
and  by  each  test  is  listed. 


Test 

Series 

Mean-value 

of  coeffic: 

Lents  of  predictors  c 

>f  Eqs. (5,6,7) 

Z5(t)   T2(t+1) 

T2(t) 

T^(t+1) 

T^(t) 

Const. term 
(Rpm) . 

r2 

1 

.87A56  0 

0 

0 

0 

93.560 

.8063 

(S.D.) 

(.06116)  0 

0 

0 

0 

(46.535) 

2 

.78742  4.46333 

-2.32201 

0 

0 

-432.644 

.8413 

(S.D.) 

(.07622) (1.2540) 

(1.28331) 

0 

0 

(239.823) 

3 

.78316  6.65738 

-3.75340 

-8.47452 

5.81224 

-7.978 

.8533 

(S.D.) 

(.08019) (1.59336) 

(1.38510) 

(3.39685) 

(3.07409) 

(206.807) 

The  significance  in  each  test  of  the  multiple  correlation  coeffi- 

2 
cient  R  and  its  counterpart  R  ,  is  that  the  latter  represents  the 

fractional  explained  variance  of  Z_(t+1)  by  whichever  of  the  regression 

equations  5,6  or  7  is  under  consideration. 

The  general  significance  of  the  results  summarized  in  Table  2,  may 

be  described  by  means  of  an  F-value  characterizing  Test  1,  an  F-value 

for  the  difference  between  Tests  2  and  1,  and  a  final  F-value  for  the 

difference  between  Tests  3  and  2.   The  first  of  these  F-statistics 

resxilts  from  Eq.(8)  using  k=l,  while  the  other  two  F-values  may  be 

expressed  [after  Anderson  (I960)],  in  the  fonn 

(R?  -  Ro)/2  (R^  -  rJ)/2 

F.  o  =  —2 ——         and  F    -    "^    ^ 


^■^   (1  -  R3)/1290       3-^    (1  -  r2)/1292 

Here,  the  subscripts  5,  3,  1  signify  the  number  of  predictors  in  the 
regression  equation  under  consideration. 
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The  three  F-values  are 


F^  =  5,386.4 


^-1  =  101-8 


F^_2  =  52.8 


each  of  which  is  highly  significant  when  compared  with  the  critical 
F-level  required  at  the  99^  confidence  level. 

To  svun  up,  there  is  a  high  degree  of  specification  skill  in  the 
stepwise  addition  of  each  variable,  or  set  of  variables,  involved 
in  proceeding  from  Test  1  through  Test  3. 
4.   Application  of  the  regression  equations  to  independent  data 

In  arriving  at  the  precise  form  of  equations  5,  6,  7  at  each 
specification  time  t+1  =2,  ....,  15,  matrix  equations  of  form 

Z^Ct+l)  =  (C^,  C^,  C^,  C3,  C^,  C^)^^^      /  1  \  (9) 


5  /t+1 


have  been  derived  using  the  predictor-variables  valid  at  t+1: 
^1  "  "^l^^)*  ^2  "  T^(t+1),  X3  =  T2(t),  X^  =  T2(t+1),  X^  =  Z^Ct) 

and  with  Z(.(t+1)  as  the  predictand.   Eq,(9)  affords  a  summary  of 
each  of  three  tests  if  the  coefficients  C  j  are  properly  interpreted 
for  each  test  (see  Appendix  Table  5).   The  coefficient  matrix  deduced 
for  the  data  array  at  time  t+1  was  then  applied  to  the  data  array  at 
time  t+2  to  generate  an  estimated  field  Zc(t+2)  from  the  matrix  equation 
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ll\ 


Z^(t+2)  =  (C^,  Ci»"-»^^\+i 


h 
X, 

X, 


V'r' 


(10) 


The  prognostic  Z^-field  of  (10)  was  next  correlated  with  the  verifying 
Zj.(t+2)  giving  a  resultant  multiple  correlation  coefficient  R(Z^,Z^)   _ 
and  a  standard  error  of  estimate  O  of  the  predicted  field  given  by 


the  standard  formula 

a^(t+2)  =  a^(t+2)[l  -  R^(t+2)]^/^ 


(11) 


The  estimated  field  Z^  was  generated  from  the  data-array  A  at  time 

(t+2)  using  an  option  of  the  statistical  program  BIMED  02R,  which  also 

2   2   2 
computed  the  resulting  statistical  parameters  R  ,  R^,  R^  together  with 

the  associated  standard  errors  o  ,     The  results  using  (10)  iteratively 

e 

on  all  possible  independent  data  times  t+2  =  3,...., 15  and  for  each 
test  are  displayed  in  Table  3. 

For  comparison  purposes,  the  corresponding  dependent  sample  statis- 

2   2   2 
tics  R- ,  R-,  Rj.  and  standard  deviations  based  upon  the  results  of  the 

specification  equations  5,  6,  7  are  also  listed  for  each  test  method 

and  validity  time  t+1  =  2,..., 15.  The  results  for  these  dependent 

samples  are  those  which  were  time-averaged  in  the  preliminary  statistical 

analysis  of  Section  3. 

It  is  to  be  noted  that  in  any  time-line  (t+2),  except  the  first 

line  of  Table  3,  there  will  be  both  a  dependent-data  test  by  Eqs, 
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(5), (6),  (7)  and  a  test  on  the  independent  data,  when  the  results  of 
Eq.(lO)  are  applied  to  the  data  at  time  (t+2). 

Time-averages  of  each  column  are  also  presented  in  Table  3. 
Since  the  standard  errors  are  root  mean  square  values,  the  time- 
averaged  values  of  O    were  obtained  by  squaring  the  individual -time 
values,  and  averaging  the  squared  quantities  prior  to  recomputing 
the  root  mean  squares. 

5,   Analysis  of  the  statistical  results. 

2   2   2 

Examination  of  the  sequence  of  values  R. ,  R-,  R_  first  under  the 

headings  "dependent -data"  reveals  the  same  conclusion  as  in  Section  3; 
that  is,  the  percentage  explained  variance  increases  monotonically 
as  more  variables  are  added  to  the  specification  equation  (9) . 

A  similar  result  obtains  when  a  comparison  is  made  of  the  in- 

2   2   2 
dependent-data  verification-results  R^ ,  R_,  R^.,  by  the  different  tests 

at  time  (t+2) .   It  is  found,  as  before,  that 

R^(t4-2)  >  R3(t+2)  >  R^(t+2) 

2 

although  the  successive  differences  in  R,  are  not  quite  as  large  as 

with  the  dependent-data  differences  for  the  same  time. 

A  test  of  the  sensitivity  of  the  prediction  coefficient  matrix 
is  considered  next.   Our  remarks  will  be  directed  specifically  to  Test  3, 
although  similar  remarks  apply  to  Tests  1  and  2.  If  the  success  of 
the  prognosis  was  strongly  dependent  upon  the  coefficient  matrix  valid 
at  time  (t+1),  one  would  expect  a  greater  coherence  between  the 
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2  2 

dependent  R  (t+1)  and  the  ensuing  (independent)  R  (t+2).   In  fact, 

2 

a  comparison  of  R^.  from  the  dependent  to  independent -data  columns 

2  2 

indicates  that  there  are  four  cases  when  Rj.(t+2)  exceeds  R_(t+1); 

in  two  cases  there  is  only  a  very  slight  change,  and  in  the  other 

2 
seven  cases,  there  is  a  sizeable  decrease  in  R  in  stepping  timewise 

from  (t+1)  to  (t+2).   On  the  other  hand,  there  is  always  a  slight 

shrinkage  at  the  same  time  t+2  in  proceeding  from  the  dependent-data 

to  the  independent-data  samples. 

This  combination  of  results  indicates  that  the  successive- time 

regression  coefficients  (C  ,  C  ,..,., Cc)  frequently  did  not  pass 

outside  of  the  standard  error  involved  in  their  determination.   For 

2 
example,  in  the  Test  1  cases,  R  (t+2)  based  upon  both  the  dependent- 
data  specification  and  the  independent  data-verification  which  used 
the  coefficient  matrix  at  time  (t+1),  were  identical  to  four  signi- 
ficant digits. 

A  separate  test  of  the  inertia  of  the  coefficient-matrix  was 
conducted  using  three-period  pooled  specification  equations  of  form  (7) 
to  determine  the  coefficient  matrix  for  prognosis  at  time  (t+2) ,   For 
example,  the  pooled  sample  of  3988  data  elements  from  times  2,3,4 
were  used  to  provide  a  coefficient  matrix  to  test  the  prognosis 

Z^(t+2=5),  and  so  on  iteratively  throughout  the  data  period  to  t+2=15. 

there 
In  no  case  was/any  significant  variation  in  explained  variance  of  the 

prognosis  arising  from  the  use  of  the  three-period  coefficient  matrix 
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sets  as  compared  with  the  single-period  coefficient  matrix  from  time 
t+1  =4,  ....,15,  Most  of  the  success  in  the  prognosis  procedure  must, 
therefore,  be  due  to  the  quality  of  the  dependent-data  sample  speci- 
fication, that  is  in  the  sample  linear  correlations,  and  partial  cor- 
relations which  relate  Z_(t+2)  to  the  five  variables  X-,....,Xc 
identified  just  below  Eq.(9),  but  now  considered  valid  at  time  t+2. 
In  spite  of  the  conclusion  that  the  linear  regression  prognosis 
model  (10)  has  coefficients  which  are  temporally  smoothed,  it  is  still 
of   considerable  value  to  determine  the  relative  usefulness  in  the 
prognostic  procedure  of  the  various  prediction  equations  identified 
as  relevant  to  Tests  1,  2,  3.   In  order  to  demonstrate  the  character 
of  the  results,  the  additional  fractional  explained  variance  by  Test  2 
compared  to  1,  by  Test  3  relative  to  2  and  Test  3  relative  to  1,  in 
the  prognosis  mode  is  summarized  in  Table  4.   Since  the  corresponding 
dependent-sample  statistics  are  to  be  considered  optimal  at  the  same 
time  as  the  prognosis  verified, the  identical- time  dependent-data 
statistics  are  listed.   It  should  be  recalled,  however,  that  the 
prognosis -mode  associated  with  the  independent-data  sample  employs 
the  coefficient  matrix  determined  at  time  t+1. 
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TABLE  4.   Added  percentage  explained  variance  by  Test  2  relative 
to  Test  1,  by  Test  3  relative  to  2,  and  Test  3  relative  to  1.   The 
comparative  results  are  shown  corresponding  to  the  same  time-line  (t+2) 
in  Table  3  by  means  of  the  parameters  (5^  added  explained  variance) 

2    2  2    2 

100 (R^  -  R  ),  etc.  Time-means  of  100 (R  -  R  ),  etc.,  are  taken  over 

the  13  times  t+2=3, . . . ,15. 


Test  (2-1) 

Test  (3-2) 

Test  (3-1) 

100 (R^  -  R^) 

100 (r2  -  r2) 

100 (R^  -  R^) 

Dep.   Indep. 

Dep.   Indep. 

Dep.   Indep. 

1.05  0.36 

0.36  0.23 

2.37  0.66 

(t+2  =  12) 

(t+2  =  8) 

(t+2  =  12) 

8.62  6.59 

2.28  2.29 

10.14  7.54 

(t+2  =  12) 

(t+2  =  3) 

(t+2  =  14) 

3.58  2.30 

1.13  0.83 

4.71  3.13 

MIN 
MAX 
MEAN 


In  Table  4  is  shown  the  minimum  and  maximum  of  percentage  explained 
variance  by  proceeding  from  the  use  of  Eqs.  (5)  to  (7)  in  the  time 
series  of  data-arrays  A.   The  results  are  shown  both  for  the  independent 
data-sample  at  (t+2) ,  as  well  as  the  added  explained  variance  using  the 
series  of  specification  equations  valid  at  the  same  time.   In  the  case 
of  minimum  and  maximum  added  variance,  the  period  number  (t+2)  observed 
from  Table  3  has  been  noted  under  the  corresponding  event  in  Table  4. 

rt  is  apparent  that  the  use  of  the  five-predictor  equation  (7) 
and/or  (10)  yields  consistent  additional  explained  variance  in  the 
prxignostic  mode  over  both  one-  and  three-predictor  equations. 
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6,   Some  remarks  concerning  the  regression-update  procedure. 

Examination  of  Tables  3  and  4  indicates  the  general  nature  of 

criteria  for  a  "successful"  prognosis  by  use  of  Eq.(lO).  Tempor- 

2 
arily,  the  term  "successful"  will  be  regarded  as  R_(t+2)  upon 

prognosis  as  exceeding  84/^  of  the  percentage  explained  variance. 

Guided  by  Table  4  but  with  specific  reference  to  the  case-by- 
case  listings  in  Table  3,  a  typical  profile  for  a  successful  prog- 
nosis may  be  sketched  in  broad  brush  form.   The  prime  requirement 

2 

appears  to  be  that  R  (t+2)  ^  0.795  upon  specification. 

At  the  same  time,  the  added  percentage  explained  variance  upon 
specification  when  the  T  -variables  are  included  should  be  of  the 
order  3  to  5^,  depending  upon  how  much  R  (t+2)  exceeded  0.795.   The 
added  explained  variance  upon  inclusion  of  the  T- -variables  does  not 
appear  to  be  so  crucial  using  the  data  of  NIMBUS  II,  although  some 
additional  explained  variance  ranging  from  0,5  to  1.0/^  seems  to  be 
a  reasonable  expectation  (Table  4) . 

Some  examples  of  relatively  inefficient  prognoses  occurred  at 

times  t+2  =4,  14,  15  (Table  3)  when  the  values  of  the  simple  lag- 

2    2 

correlation  coefficient-squared,  R  =  R  [Zc(t+2),  Z-(t+l)],  were 

0.7441,  0.6711,  0.7350,  respectively.  One  does  not  know,  a  priori, 

2 

at  time  t+2  what  this  value  of  R  (t+2)  will  prove  to  be.   However, 

2 

one  method  of  testing  for  a  tentative  value  of  R  (t+2)  would  be  to 

compute  Zt.(t+2),  with  the  update  radiative  fields  in  hand,  and  then 
computing  the  simulated  R  (t+2).   An  examination  of  Test  1  from 
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2 

Table  3  suggests  that  the  proper  trend  in  R  (t+2)  would  thus  be 

indicated.   For  the  most  part,  however,  the  results  upon  prognosis 
(independent  data)  in  Table  3  with  the  exception  of  t+2  =  14,  15, 
were  encouraging. 

Two  examples  of  prognoses  and  their  comparisons  with  analyses 
at  the  same  verifying  times  are  given  in  Section  7. 
7..  Some  graphical  prognoses  and  their  verifications. 

In  this  section,  two  experiments  are  described  based  upon  the 
use  of  the  five-predictor  Eq.(lO)  derived  from  the  coefficient-array 
specified  at  time  t+1  and  applied  to  a  prognosis  verifying  at  time 
t+2.   The  two  prediction  equations  generated  correspond  to  the  use 
of  the  coefficient-matrices 

taken  from  Table  5.  These  were  then  applied  to  the  data  arrays  at 
t  =  06  and  12,  respectively  for  the  computation  of 
z'^(t+2=06)   and  z'^(t+2=12) 

Contours  were  drawn  of  intervals  of  60,0  gpm,  over  the  map  fields 
shown  in  Fig.  3  and  6.   These  two  maps  depict  the  prognoses  for 

z'^(I,J,t=06)   and  z'^(I, J, t=12) 

are 
and/to  be  verified  against  the  actual  contour  analyses  shown  in  Figs, 

2  and  5  respectively  at  times  t=06  and  t=12.   The  corresponding 

cs,/ 

error  fields  (Z  -  Z)  are  shown  in  Figs.  4  and  7. 

In  the  case  of  Z^(t=06),  Table  3  shows  that  fractional  explained 
variance  was  .8944  for  the  independent  (verifying)  data.   This 


21 


22 


CO 


•p-l 

P4 


CO 

o 

C 
60 
O 
U 

a 

0) 


6 
o 
u 

M 


ON 


0) 


o 
o 


4J 
CO 


•T3 
I-( 
(U 
•H 
M-l 

o 

u 
<u 

9) 

s: 

H 


M 


23 


corresponds  to  an  R.M. S.  error  of  56.628  gpm.   At  t=12,  the  verify- 
ing  field  Z  (12)  explained  .8616  of  the  variance  of  Z  (12),  with  an 
R.M.  S.  error  of  57.549  gpm.   These  two  examples  were  selected  to 
show  that  the  prognostic  update  procedure  outlined  here  seems  to 
behave  rather  well  throughout  the  time  series. 

Apart  from  the  quoted  R.M. S.  standard  errors  in  Z^  for  the  two 
prognostic  examples,  the  prognoses  nevertheless  give  good 
definition  of  the  verifying  large-scale  map  features.  A  comparison 
of  Fig.  3  with  Fig.  2  gives  rather  good  evidence  of  this  statement. 
The  geographic  distribution  of  the  actual  troughs  and  ridges  in  the 
westerlies  in  Fig.  2  is  clearly  identifiable  in  the  prognoses  of 
Fig.  3.  For  example,  the  ridges  over  western  North  America,  and  over 
western  Europe  are  clearly  evident  in  both  Fig.  2  and  3,  as  is  the 
low  in  the  Aleutian  area.  The  rather  complicated  wave  structure  over 
Eurasia  (Fig.  2)  has  been  well  recaptured  in  Fig.  3, 

Comparison  of  Fig.  5  with  Fig.  6  shows  that  the  latter  depicts  most 
of  the  features  of  the  verifying  map  at  t=12.   Among  these  are  the 
tilted  SW-NE  ridge  over  western  North  America,  and  a  second  ridge 
extending  S-N  in  over  the  Atlantic  into  the  Greenland  area.   In  Fig.  6 
troughs  over  western  Europe  and  eastern  North  America  are  closely 
comparable  to  similar  features  on  the  verifying  map  Fig,  5.   The  well 
defined  ridge  predicted  over  Western  Siberia  in  Fig.  6  is  likewise 
identified  in  Fig.  5,  Fig.  6   shows   considerable  more  trough-ridge 
amplitude  over  the  Pacific;   however,  this  is  an  area  of  the  verifying 
analysis  (Fig.  5)  where  much  of  the  daily  perturbation-amplitude 
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appears  to  have  been  smoothed  out  by  the  five-day  time -aver aging. 
The  error  chart.  Fig.  7,  has  only  a  small  proportion  of  its  area 
covered  by  isopleths  of  error -magnitude  in  excess  of  90  gpm,  although 
an  extreme  value  of  +210  gpm  is  present  in  N.E.  Siberia. 

On  the  whole,  however,  both  prognostic  maps  show  features  with 
good  correlation  to  those  of  the  verifying  maps,  where  the  latter 
have  well-defined  amplitudes. 
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additional  predictors,  the  5-day  mean  composited  gridpoint  values  of  the  NIMBUS 
II  MRIR  equivalent  black  body  temperatures  in  the  water-vapor  channel  (denoted 
T. )  and  the  window  channel  (denoted  T.)  were  introduced  for  both  analyses  times 
t  and  t+1.   The  five-predictor  regression  equations  thus  developed  for  Z^(I,J, 
t+1)  proved  to  have  substantial  statistical  significance,  both  from  the  stand- 
point of  the  specification  and  of  prediction  of  Z^(I,J,t+2)  valid  five  days 
later  (t+2  =  3,  ...,  15).   In  the  latter  connection,  the  prognosis  of  Z^(t+2) 
was  significantly  improved  when  all  five  predictors  were  included,  as  compared 
with  the  case  when  the  data  of  either  radiometric  channel  T^(I,J)  or  T2(I,J)  at 
times  t+1,  t+2  was  deleted  from  the  prediction  equations. 
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